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METHOD AND APPARATUS FOR IMPROVING THE INTELLIGIBILITY 
OF DIGITALLY COMPRESSED SPEECH 



TECHNICAL FIELD 
The invention relates generally to speech processing and, 
more particularly, to techniques for enhancing the 
intelligibility of processed speech, 

BACKGROUND OF THE INVENTION 
Human speech generally has a relatively large dynamic 
range. For example, the amplitudes of some consonant sounds 
(e.g., the imvoiced consonants P, T, S, and F) are often 30 dB 
lower than the amplitudes of vowel sounds in the same spoken 
sentence. Therefore, the consonant sotmds will sometimes drop 
below a listener's speech detection threshold, thus 
compromising the intelligibility of the speech. This problem 
is exacerbated when the listener is hard of hearing, the 
listener is located in a noisy environment, or the listener is 
located in an area that receives a low signal strength. 

Traditionally, the potential unintelligibility of certain 
sounds in a speech signal was overcome using some form of 
amplitude compression on the signal. For example, in one 
prior approach, the amplitude peaks of a speech signal were 
clipped and the resulting signal was amplified so that the 
difference between the peaks of the new signal and the low 



portions of the new signal would be reduced while maintaining 
the signal's original loudness. Amplitude compression, 
however, often leads to other forms of distortion within the 
resultant signal, such as the harmonic distortion resulting 
from flattening out the high amplitude components of the 
signal . In addition, amplitude compression 'techniques tend to 
amplify some undesired low-level signal components (e.g., 
background noise) in an inappropriate manner, thus 
compromising the quality of the resultant signal. 

Therefore, there is a need for a method and apparatus 
that is capable of enhancing the intelligibility of processed 
speech without the undesirable effects associated with prior 
techniques . 

SUMMARY OF THE INVENTION 
The present invention relates to a system that is capable 
of significantly enhancing the intelligibility of processed 
speech. The system first divides the speech signal into 
frames or segments as is commonly performed in certain low bit 
rate speech encoding algorithms, such as Linear Predictive 
Coding (LPC) and Code Excited Linear Prediction (CELP) . - The 
system then analyzes the spectral content of each frame to 
determine a sound type associated with that frame. The 
analysis of each frame will typically be performed in the 



context of one or more other frames surrounding the frame of 
interest. The analysis may determine, for example, whether 
the sound associated with the frame is a vowel sound, a voiced 
fricative, or an unvoiced plosive. 
5 Based on the sound type associated with a particular 

frame, the system will then modify the frame if it is believed 
that such modification will enhance intelligibility. For 
example, it is known that unvoiced plosive sounds commonly 
have lower amplitudes than other sounds within human speech. 

10 The amplitudes of frames identified as including unvoiced 
plosives are therefore boosted with respect to other frames. 
In addition to modifying a frame based on the sound type 
associated with that frame, the system may also modify frames 
surrounding that particular frame based on the sound type 

15 associated with the frame. For example, if a frame of 
interest is identified as including an unvoiced plosive, the 
amplitude of the frame preceding this frame of interest can be 
reduced to ensure that the plosive isn't mistaken for a 
spectrally similar fricative. By basing frame modification 

20 decisions on the type of speech included within a particular 
frame, the problems created by blind signal modifications 
based on amplitude (e.g., boosting all low-level signals) are 
avoided. That is, the inventive principles allow frames to be 



modified selectively and intelligently to achieve an enhanced 
signal intelligibility. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Fig. 1 is a block diagram illustrating a speech 

processing system in accordance with one' embodiment of the 
present invention; 

Fig, 2 is a flowchart illustrating a method for 
processing a speech signal in accordance with one embodiment 
10 of the invention; and 

Figs. 3 and 4 are portions of a flowchart illustrating a 
method for use in enhancing the intelligibility of speech 
signals in accordance with one embodiment of the present 
invention. 

15 DETAILED DESCRIPTION 

The present invention relates to a system that is capable 
of significantly enhancing the intelligibility of processed 
speech. The system determines a sound type associated with 
individual frames of a speech signal and modifies those frames 

20 based on the corresponding sound type. In one approach,, the 
inventive principles are implemented as an enhancement to 
well-known speech encoding algorithms, such as the LPC and 
CELP algorithms, that perform frame-based speech digitization. 
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The system is capable of improving the intelligibility of 
speech signals without generating the distortions often 
associated with prior art amplitude clipping techniques. The 
inventive principles can be used in a variety of speech 
applications including, for example, messaging systems, IVR 
applications, and wireless telephone systems. The inventive 
principles can also be implemented in devices designed to aid 
the hard of hearing such as, for example, hearing aids and 
cochlear implants. 

Fig. 1 is a block diagram illustrating a speech 
processing system 10 in accordance with one embodiment of the 
present invention. The speech processing system 10 receives 
an analog speech signal at an input port 12 and converts this 
signal to a compressed digital speech signal which is output 
at an output port 14. In addition to performing signal 
compression and analog to digital conversion functions on the 
input signal, the system 10 also enhances the intelligibility 
of the input signal for later playback. As illustrated, the 
speech processing system 10 includes: an analog to digital 
(A/D) converter 16, a frame separation unit 18, a frame 
analysis unit 20, a frame modification unit 22, and a 
compression unit 24. It should be appreciated that the blocks 
illustrated in Fig. 1 are functional in nature and do not 



necessarily correspond to discrete hardware elements. In one 
embodiment, for example, the speech processing system 10 is 
implemented within a single digital processing device. 
Hardware implementations, however, are also possible* 
5 With reference to Fig. 1, the analog speech signal 

received at port 12 is first sampled and digitized within the 
A/D converter 16 to generate a digital waveform for delivery 
to the frame separation iinit 18. The frame separation unit 18 
is operative for dividing the digital waveform into individual 

10 time-based frames. In a preferred approach, these frames are 
each about 20 to 25 milliseconds in length. The frame 
analysis unit 20 receives the frames from the frame separation 
unit 18 and performs a spectral analysis on each individual 
frame to determine a spectral content of the frame. The frame 

15 analysis unit 20 then transfers each frame's spectral 
information to the frame modification unit 22, The frame 
modification unit 22 uses the results of the spectral analysis 
to determine a sound type (or type of speech) associated with 
each individual frame. The frame modification unit 22 then 

20 modifies selected frames based on the identified sound types. 
The frame modification unit 22 will normally analyze the 
spectral information corresponding to a frame of interest and 
also the spectral information corresponding to one or more 
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frames surrounding the frame of interest to determine a sound 
type associated with the frame of interest. 

The frame modification unit 22 includes a set of rules 
for modifying selected frames based on the sound type 
5 associated therewith. In one embodiment, the frame 
modification unit 22 also includes rules for modifying frames 
surrounding a frame of interest based on the sound type 
associated with the frame of interest. The rules used by the 
frame modification unit 22 are designed to increase the 

10 intelligibility of the output signal generated by the system 
10 . Thus, the modifications are intended to emphasize the 
characteristics of particular sounds that allow those sounds 
to be distinguished from other similar sounds by the human 
ear. Many of the frames may remain unmodified by the frame 

15 modification unit 22 depending upon the specific rules 
programmed therein. 

The modified and unmodified frame information is next 
transferred to the data assembly unit 24 which assembles the 
spectral information for all of the frames to generate the 

20 compressed output signal at output port 14. The compressed 
output signal can then be transferred to a remote location via 
a communication medium or stored for later decoding and 
playback. It should be appreciated that the intelligibility 



enhancement functions of the frame modification unit 22 of 
Fig, 1 can alternatively (or additionally) be performed as 
part of the decoding process during signal playback. 

In one embodiment, the inventive principles are 
implemented as an enhancement to certain well-known speech 
encoding and/or decoding algorithms, such as the Linear 
Predictive Coding (LPC) algorithm and the Code-Excited Linear 
Prediction (CELP) algorithm. In fact, the inventive 
principles can be used in conjunction with virtually any 
encoding or decoding algorithm that is based upon frame-based 
speech digitization (i.e., breaking up speech into individual 
time-based frames and then capturing the spectral content of 
each frame to generate a digital representation of the 
speech) . Typically, these algorithms utilize a mathematical 
model of human vocal tract physiology to describe each frame's 
spectral content in terms of human speech mechanism analogs, 
such as overall amplitude, whether the frame's sound is voiced 
or unvoiced, and, if the sound is voiced, the pitch of the 
sound. This spectral information is then assembled into a 
compressed digital speech signal. A more detailed description 
of various speech digitization algorithms that can be modified 
in accordance with the present invention can be found in the 
paper ^'Speech Digitization and Compression" by Paul Michaelis, 
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International Encyclopedia of Ergonomics and Human Factors, 
edited by Waldamar Karwowski, published by Taylor & Francis, 
London, 2000, which is hereby incorporated by reference. 

In accordance with one embodiment of the invention, the 
5 spectral information generated within such algorithms (and 
possibly other spectral information) is used to determine a 
sound type associated with each frame. Knowledge about which 
sound types are important for intelligibility and are 
typically harder to hear is then used to develop rules for 

10 modifying the frame information in a manner that increases 
intelligibility. The rules are then used to modify the frame 
information of selected frames based on the determined sound 
type. The spectral information for each of the frames, 
whether modified or unmodified, is then used to develop the 

15 compressed speech signal in a conventional manner (e.g., the 
manner typically used by the LPC, CELP, or other similar 
algorithms) . 

Fig, 2 is a flowchart illustrating a method for 
processing an analog speech signal in accordance with one 
20 embodiment of the present invention. First, the speech signal 
is digitized and separated into individual frames (step 3 0) . 
A spectral analysis is then performed on each individual frame 
to determine a spectral content of the frame (step 32) . 

9 



Typically, spectral parameters such as amplitude, voicing, and 
pitch (if any) of sounds will be measured during the spectral 
analysis. The spectral content of the frames is next analyzed 
to determine a sound type associated with each frame {step 
5 34) . To determine the sound type associated with a particular 
frame, the spectral content of other fratnes surrounding the 
particular frame will often be considered. Based on the sound 
type associated with a frame, information corresponding to the 
frame may be modified to improve the intelligibility of the 

10 output signal (step 36) . Information corresponding to frames 
surrounding a frame of interest may also be modified based on 
the sound type of the frame of interest. Typically, the 
modification of the frame information will include boosting or 
reducing the amplitude of the corresponding frame. However, 

15 other modification techniques are also possible. For example, 
the reflection coefficients that govern spectral filtering can 
be modified in accordance with the present invention. The 
spectral information corresponding to the frames, whether 
modified or unmodified, is then assembled into a compressed 

2 0 speech signal (step 38) . This compressed speech signal, can 
later be decoded to generate an audible speech signal having 
enhanced intelligibility. 
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Figs. 3 and 4 are portions of a flowchart illustrating a 
method for use in enhancing the intelligibility of speech 
signals in accordance with one embodiment of the present 
invention. The method is operative for identifying unvoiced 
fricatives and voiced and unvoiced plosives within a speech 
signal and for adjusting the amplitudes * of corresponding 
frames of the speech signal to enhance intelligibility. 
Unvoiced fricatives and unvoiced plosives are sounds that are 
typically lower in volume in a speech signal than other sounds 
in the signal. In addition, these sounds are usually very 
important to the intelligibility of the underlying speech. A 
voiced speech sound is one that is produced by tensing the 
vocal cords while exhaling, thus giving the sound a specific 
pitch caused by vocal cord vibration. The spectrum of a 
voiced speech sound therefore includes a fundamental pitch and 
harmonics thereof. An unvoiced speech sound is one that is 
produced by audible turbulence in the vocal tract and for 
which the vocal cords remain relaxed. The spectrum of an 
unvoiced speech signal is typically similar to that of white 
noise . 

With reference to Fig. 3, an analog speech signal is 
first received (step 50) and then digitized (step 52) . The 
digital waveform is then separated into individual frames 



(step 54) . In a preferred approach, these frames are each 
about 20 to 25 milliseconds in length. A frame-by- frame 
analysis is then performed to extract and encode data from the 
frames, such as amplitude, voicing, pitch, and spectral 
5 filtering data (step 56) , When the extracted data indicates 
that a frame includes an unvoiced fricative, the amplitude of 
that frame is increased in a manner that is designed to 
increase the likelihood that the loudness of the sound in a 
resulting speech signal exceeds a listener's detection 

10 threshold (step 58) . The amplitude of the frame can be 
increased, for example, by a predetermined gain value, to a 
predetermined amplitude value, or the amplitude can be 
increased by an amount that depends upon the amplitudes of the 
other frames within the same speech signal, A fricative sound 

15 is produced by forcing air from the lungs through a 
constriction in the vocal tract that generates audible 
turbulence. Examples of unvoiced fricatives include the "f" 
in fat, the "s" in sat, and the ^^ch" in chat. Fricative 
sounds are characterized by a relatively constant amplitude 

20 over multiple sample periods. Thus, an unvoiced fricative can 
be identified by comparing the amplitudes of multiple 
successive frames after a decision has been made that the 
frames correspond to unvoiced sounds. 

12 



When the extracted data indicates that a frame is the 
initial component of a voiced plosive, the amplitude of the 
frame preceding the voiced plosive is reduced (step 60) . A 
plosive is a sound that is produced by the complete stoppage 
and then sudden release of the breath. ■ Plosive sounds are 
thus characterized by a sudden drop in amplitude followed by a 
sudden rise in amplitude within a speech signal. An example 
of voiced plosives includes the ^'b" in bait, the ^^d" in date, 
and the "g" in gate. Plosives are identified within a speech 
signal by comparing the amplitudes of adjacent frames in the 
signal. By decreasing the amplitude of the frame preceding 
the voiced plosive, the amplitude "spike" that characterizes 
plosive sounds is accentuated, resulting in enhanced 
intelligibility. 

When the extracted data indicates that a frame is the 
initial component of an unvoiced plosive, the amplitude of the 
frame preceding the unvoiced plosive is decreased and the 
amplitude on the frame including the unvoiced plosive is 
increased (step 62) . The amplitude of the frame preceding the 
unvoiced plosive is decreased to emphasize the amplitude 
''spike" of the plosive as described above. The amplitude of 
the frame including the initial component of the unvoiced 
plosive is increased to increase the likelihood that the 

13 



loudness of the sound in a resulting speech signal exceeds a 
listener's detection threshold. 

With reference to Fig. 4, a frame -by -frame reconstruction 
of the digital waveform is next performed using, for example, 
5 the amplitude, voicing, pitch, and spectral filtering data 
(step 64). The individual frames are then "concatenated into a 
complete digital sequence (step 66) . A digital to analog 
conversion is then performed to generate an analog output 
signal (step 68) . The method illustrated in Figs, 4 and 5 can 

10 be performed all at one time as part of a real-time 
intelligibility enhancement procedure or it can be performed 
in multiple sub-procedures at different times. For example, 
if the method is implemented within a hearing aid, the entire 
method will be used to transform an input analog speech signal 

15 into an enhanced output analog speech signal for detection by 
a user of the hearing aid. In an alternative implementation, 
steps 50 through 62 may be performed as part of a speech 
signal encoding procedure while steps 64 through 68 are 
performed as part of a subsequent speech signal decoding 

2 0 procedure. In another alternative implementation, steps 50 
through 56 are performed as part of a speech signal encoding 
procedure while steps 58 through 68 are performed as part of a 
subsequent speech decoding procedure. In the period between 

14 



the encoding procedure and the decoding procedure, the speech 
signal can be stored within a memory unit or be transferred 
between remote locations via a communication channel. In a 
preferred implementation, steps 50 through 56 are performed 
5 using well-known LPC or CELP encoding techniques. Similarly, 
steps 64 through 68 are preferably performed using well-known 
LPC or CELP decoding techniques. 

In a similar manner to that described above, the 
inventive principles can be used to enhance the 

10 intelligibility of other sound types. Once it has been 
determined that a particular type of sound presents an 
intelligibility problem, it is next determined how that type 
of sound can be identified within a frame of a speech signal 
(e.g., through the use of spectral analysis techniques and 

15 comparisons between adjacent frames) , It is then determined 
how a frame including such a sound needs to be modified to 
enhance the intelligibility of the sound when the compressed 
signal is later decoded and played back. Typically, the 
modification will include a simple boosting of the amplitude 

20 of the corresponding frame, although other types of frame 
modification are also possible in accordance with the present 
invention (e.g., modifications to the reflection coefficients 
that govern spectral filtering) . 

15 



An important feature of the present invention is that 
compressed speech signals generated using the inventive 
principles can usually be decoded using conventional decoders 
(e.g., LPC of CELP decoders) that have not been modified in 
5 accordance with the invention • In addition, decoders that 
have been modified in accordance with the 'present invention 
can also be used to decode compressed speech signals that were 
generated without using the principles of the present 
invention. Thus, systems using the inventive techniques can 

10 be upgraded piecemeal in an economical fashion without concern 
about widespread signal incompatibility within the system. 

Although the present invention has been described in 
conjunction with its preferred embodiments, it is to be 
understood that modifications and variations may be resorted 

15 to without departing from the spirit and scope of the 
invention as those skilled in the art readily understand. 
Such modifications and variations are considered to be within 
the purview and scope of the invention and the appended 
claims . 
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What is claimed is : 

1. A method for processing a speech signal comprising the 
steps of: 

receiving a speech signal to be processed; 
dividing said speech signal into multiple frames; 
analyzing a frame generated in said' 'dividing step to 
determine a sound type associated with said frame; and 
modifying said frame based on said sound type. 

2. The method claimed in claim 1, wherein: 

said step of analyzing includes performing a spectral 
analysis on said frame to determine a spectral content of said 
frame . 

3. The method claimed in claim 2, wherein: 

said step of analyzing includes examining said spectral 
content of said frame to determine whether said frame includes a 
voiced or unvoiced sound. 
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4. The method claimed in claim 1, wherein: 

said step of analyzing includes determining an amplitude of 
said frame and comparing said amplitude of said frame to an 
amplitude of a previous frame to determine whether said frame 
includes a plosive sound. 

5. The method claimed in claim 1, wherein: 

said step of modifying said frame includes changing an 
amplitude of said frame when said frame is determined to include 
a first sound type, 

6. The method claimed in claim 5, wherein: 

said step of modifying said frame includes boosting an 
amplitude of said frame when said frame is determined to include 
an unvoiced plosive. 

7. The method claimed in claim 1, wherein: 

said step of modifying said frame includes changing a 
parameter associated with said frame in a manner that enhances 
intelligibility of an output signal. 
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8. The method claimed in claim 1, further comprising the 
step of: 

modifying another frame based on said soimd type detennined 
in said step of analyzing. 

9. The method claimed in claim 8, wherein: 

said step of modifying another frame includes reducing an 
amplitude of a previous frame when said sound type is an 
unvoiced plosive. 

10. A computer readable medium having program instructions 
stored thereon for implementing the method of claim 1 when 
executed within a digital processing device. 

11. A method for processing a speech signal comprising the 
steps of: 

providing a speech signal that is divided into time-based 
frames; 

analyzing each of said frames in the context of surrounding 
frames; and 

adjusting an amplitude of selected frames based on a result 
of said step of analyzing. 
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12. The method of claim 11, wherein: 

said step of adjusting includes adjusting the amplitude of 
a frame in a manner that enhances intelligibility of an output 
signal . 

13. The method of claim 11, wherein: 

said step of analyzing includes determining a sound type 
associated with a first frame. 

14. The method of claim 13, wherein: 

said step of adjusting includes increasing the amplitude of 
said first frame when said sound type associated with said first 
frame includes an unvoiced plosive. 

15. The method of claim 13, wherein: 

said step of adjusting includes increasing the amplitude of 
said first frame when said sound type associated with said first 
frame includes an unvoiced fricative. 



16. The method of claim 13, wherein: 

said step of adjusting includes decreasing the amplitude of 
a second frame that is previous to said first frame when said 
sound type associated with said first frame includes a voiced or 
unvoiced plosive. 

17. The method of claim 11, wherein: 

said step of analyzing includes comparing an amplitude of a 
first frame to an amplitude of a frame previous to said first 
frame, 

18. A computer readable medium having program instructions 
stored thereon for implementing the method claimed in claim 11 
when executed in a digital processing device. 

19. A system for processing a speech signal comprising: 
means for receiving a speech signal that is divided into 

time-based frames; 

means for determining a sound type associated with each of 
said frames; and 

means for modifying selected frames based on sound type to 
enhance signal intelligibility. 



20 • The system claimed in claim 19, wherein: 
said system is implemented within a linear predictive 
coding { LPC ) encoder . 

21. The system claimed in claim 19, wherein: 

said system is implemented within a code excited linear 
prediction (CELP) encoder. 

22. The system claimed in claim 19, wherein: 

said system is implemented within a linear predictive 
coding (LPC) decoder. 

23, The system claimed in claim 19, wherein: 

said system is implemented within a code excited linear 
prediction (CELP) decoder. 

24, The system claimed in claim 19, wherein: 

said means for determining includes means for performing a 
spectral analysis on a frame. 
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25. The system claimed in claim 19, wherein: 

said means for determining includes means for comparing 
amplitudes of adjacent frames. 

26. The system claimed in claim 19, wherein: 

said means for determining includes means for ascertaining 
whether a frame includes a voiced or unvoiced sound. 

27. The system claimed in claim 19, wherein: 

said means for modifying includes means for boosting the 
amplitude of a frame that includes a sound type that is 
typically less intelligible than other sound types. 

28. The system claimed in claim 19, wherein: 

said means for modifying includes means for boosting the 
amplitude of a frame that includes an unvoiced plosive. 

29. The system claimed in claim 19, wherein: 

said means for modifying includes means for reducing the 
amplitude of a frame that precedes a frame that includes an 
unvoiced plosive. 
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30. The system claimed in claim 19, wherein: 
said means for determining a sound type includes means for 
determining whether a frame includes at least one of the 
following: a vowel sound, a voiced fricative, an unvoiced 
fricative, a voiced plosive, and an unvoiced 'plosive. 
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ABSTRACT 

A system for processing a speech signal to enhance signal 
intelligibility identifies portions of the speech signal that 
include sounds that typically present intelligibility problems 
5 and modifies those portions in an appropriate manner. First, 
the speech signal is divided into a plurality of time -based 
frames. Each of the frames is then analyzed to determine a 
sound type associated with the frame. Selected frames are then 
modified based on the sound type associated with the frame or 

10 with surrounding frames. For example, the amplitude of frames 
determined to include unvoiced plosive sounds may be boosted as 
these sounds are known to be important to intelligibility and 
are typically harder to hear than other sounds in normal speech. 
In a similar manner, the amplitudes of frames preceding such 

15 unvoiced plosive sounds can be reduced to better accentuate the 
plosive. Such techniques will make these sounds easier to 
distinguish upon subsequent playback. 
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DIGITIZE ANALOG SPEECH 
SIGNAL AND SEPARATE INTO 
INDIVIDUAL FRAMES 
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PERFORM SPECTRAL ANALYSIS ON EACH 
INDIVIDUAL FRAME TO DETERMINE 
SPECTRAL CONTENT THEREOF 



ANALYZE SPECTRAL CONTENT OF 
FRAMES TO DETERMINE A SOUND TYPE 
ASSOCIATED WITH EACH FRAME 



BASED ON SOUND TYPE, MODIFY SELECTED 
FRAME INFORMATION TO IMPROVE 
INTELLIGIBILITY OF OUTPUT SIGNAL 



GENERATE COMPRESSED SPEECH SIGNAL 
USING MODIFIED AND UNMODIFIED 
FRAME INFORMATION 



Fig. 2 



INPUT ANALOG SPEECH SIGNAL 
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PERFORM ANALOG TO DIGITAL 
CONVERSION ON ANALOG SPEECH SIGNAL 
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SEPARATE DIGITAL WAVEFORM INTO FRAMES 
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PERFORM FRAME-BY-FRAME ANALYSIS TO EXTRACT 
AND ENCODE DATA, SUCH AS AMPLITUDE, VOICING, 

AND PITCH DATA 
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WHEN THE EXTRACTED DATA INDICATES THAT A FRAME IS AN 
UNVOICED FRICATIVE, INCREASE THE AMPLITUDE OF THE FRICATIVE 



WHEN THE EXTRACTED DATA INDICATES THAT A FRAME IS 
THE INITIAL COMPONENT OF A VOICED PLOSIVE, DECREASE 
THE AMPLITUDE OF THE FRAME PRECEDING THE PLOSIVE 
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WHEN THE EXTRACTED DATA INDICATES THAT A FRAME IS 

THE INITIAL COMPONENT OF AN UNVOICED PLOSIVE, 
DECREASE THE AMPLITUDE OF THE FRAME PRECEDING THE 
PLOSIVE AND INCREASE THE AMPLITUDE OF THE PLOSIVE 
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Fig. 3 
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PERFORM FRAME-BY-FRAME RECONSTRUCTION OF THE 
DIGITAL WAVEFORM USING, FOR EXAMPLE, AMPLITUDE, 
VOICING, AND PITCH DATA 
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CONCATENATE INDIVIDUAL FRAMES INTO COMPLETE 

DIGITAL SEQUENCE 
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PERFORM DIGITAL-TO-ANALOG CONVERSION 
TO GENERATE ANALOG OUTPUT SIGNAL 
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Fig. 4 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

DECLARATION AND POWER OF ATTORNEY 

As the below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor of the subject matter which is claimed and 
for which a patent is sought on the invention entitled METHOD AND APPARATUS FOR 
IMPROVING THE INTELLIGIBILITY OF DIGITALLY COMPRESSED SPEECH the 

specification of which is attached hereto. 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by an amendment, if any, specifically referred to in 
this oath or declaration. 

I acknowledge the duty to disclose all information known to me which is material to 
patentability as defined in title 37, Code of Federal Regulations, L56. 

I hereby claim foreign priority benefits under Title 35, United States Code, 119 of any 
foreign application (s) for patent or inventor's certificate listed below and have also identified below 
any foreign application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed: 

None 

I hereby claim the benefit under Title 35, United States Code, 120 of any United states 
application(s) listed below and, insofar as the subject matter of each of the claims of this application 
is not disclosed in the prior United States application in the manner provided by the first paragraph 
of Title 35, United States Code, 1 12, 1 acknowledge the duty to disclose all information known to 
me to be material to patentability as defined in Title 37, Code of Federal Regulations, 1.56 which 
became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 

None 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and fiirther that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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I hereby appoint the following attomey(s) with full power of substitution and revocation, to 
prosecute said application, to make alterations and amendments therein, to receive the patent, and 
to transact all business in the Patent and Trademark Office connected therewith: 



Lester H. Bimbaum (Reg. No. 25830) 

Richard J. Botos (Reg. No. 32016) 

Jeffery J. Brosemer (Reg. No. 36096) 

Kenneth M. Brown (Reg. No. 37590) 

Donald P. Dinella (Reg. No. 39961) 

Guy Eriksen (Reg. No. 41736) 

Martin I. Finston (Reg. No. 3 1 6 1 3) 

James H. Fox (Reg. No. 29379) 

William S. Francos (Reg. No. 38456) 

Barry H. Freedman (Reg. No. 26 1 66) 

Julio A. Garceran (Reg. No. 37 1 3 8) 

Mony R. Ghose (Reg. No. 3 8 1 59) 

Jimmy Goo (Reg. No. 36528) 

Anthony Grillo (Reg. No. 36535) 

Stephen M. Gurey (Reg. No. 27336) 

John M. Harman (Reg. No. 38 173) 

John W. Hayes (Reg. No. 33900) 

Michael B. Johannesen (Reg. No. 35557) 

Mark A. Kurisko (Reg. No. 38944) 

Irena Lager (Reg. No. 39260) 

Christopher N. Malvone (Reg. No. 34866) 

Scott W. McLellan (Reg. No. 30776) 

Martin G. Meder (Reg. No. 34674) 

Geraldine Monteleone (Reg. No. 40097) 

John C. Moran (Reg. No. 30782) 

Michael A. Morra (Reg. No. 28975) 

Gregory J. Murgia (Reg. No. 41209) 

Claude R. Narcisse (Reg. No. 38979) 

Joseph J. Opalach (Reg. No. 36229) 

Neil R. Ormos (Reg. No. 35309) 

Eugen E. Pacher (Reg. No. 29964) 

Jack R. Penrod (Reg. No. 3 1 864) 

Daniel J. Piotrowski (Reg No. 42079) 

Gregory C. Ranieri (Reg. No. 29695) 

Scott J. Rittman (Reg. No. 390 10) 

Eugene J. Rosenthal (Reg. No. 36658) 

Bruce S. Schneider (Reg. No. 27949) 

Ronald D. Slusky (Reg. No. 26585) 

David L. Smith (Reg. No. 30592) 

Patricia A. Verlangieri (Reg. No. 42201) 
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John P. Veschi 



(Reg. No. 29355) 
(Reg. No. 29355) 
(Reg. No. 27407) 
(Reg. No. 36304) 
(Reg. No. 17765) 



David Volejnicek 
Charles L. Warren 
Jeffrey M. Weinick 
Eli Weiss 



I hereby appoint the attomey(s) on ATTACHMENT A as associate attomey(s) in the 
aforementioned application, with full power solely to prosecute said application, to make alterations 
and amendments therein, to receive the patent, and to transact all business in the Patent and 
Trademark Office connected with the prosecution of said application. No other powers are granted 
to such associate attomey(s) and such associate attomey(s) are specifically denied any power of 
substitution or revocation. 



Full name of sole inventor (or 1st joint inventor): Paul Roller Michaelis 

Inventor's % j ^ ^ ^ 

signature / A Date jV^ ^ 

Residence: Louisville, Colorado 

Citizenship: USA 



Post Office Address: 1918 Quail Circle 

Louisville, CO 80027 
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ATTACHMENT A 



David F. Zinger 
Craig C. Groseth 
Michael L. Tompkins 
Sabrina C. Stavish 
Todd P. Blakely 
Lewis D. Hansen 
Joseph E. Kovarik 
Gary J. Connell 
Wannell M. Crook 
Douglas W. Swartz 
Bruce A. Kugler 
Robert R. Bnmelli 
Tejpal S. Hansra 
Dana L. Hartje 
Don D. Cha 
Angela Dallas-Pedretti 
Benjamin B. Lieb 
Bradley M. Knepper 
Theresa A. Brown 
John C. Scott 
Miriam R. Drickman 



(Reg. No. 29127) 
(Reg. No. 31713) 
(Reg. No. 30980) 
(Reg. No. 33374) 
(Reg. No. 31328) 
(Reg. No. 35536) 
(Reg. No. 33005) 
(Reg. No. 32020) 
(Reg. No. 31071) 
(Reg. No. 37739) 
(Reg. No. 38942) 
(Reg. No. 39617) 
(Reg. No. 38172) 
(Reg. No. 40638) 
(Reg. No. 40945) 
(Reg. No. 42460) 
(Reg. No. 42801) 
(Reg. No. 44189) 
(Reg. No. 32547) 
(Reg. No. 38613) 
(Reg. No. 42499) 



Telephone calls should be made to Sheridan Ross P.C. at: 



Phone No.: (303) 863-9700 
Fax No.: r303^ 863-0223 



All written communications are to be addressed to: 



Douglas W. Swartz 
Sheridan Ross P.C. 
Suite 1200 
1560 Broadway 
Denver, CO 80202-5141 
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